A tool for interactive Subgroup Discovery

نویسندگان

  • Joel P. Lucas
  • Alípio M. Jorge
  • Fernando Pereira
  • Ana M. Pernas
  • Amauri A. Machado
چکیده

We describe an approach and a tool for the discovery of subgroups within the framework of distribution rule mining. Distribution rules are a kind of association rules particularly suited for the exploratory study of numerical variables of interest. Being an exploratory technique, the result of a distribution mining process is typically a very large number of patterns. Exploring such results is thus a complex task and limits the use of the technique. To overcome this shortcoming we developed a tool, written in Java, which supports subgroup discovery in a post-processing step. The tool engages the analyst in an interactive process of subgroup discovery by means of a graphical interface with well defined statistical grounds, where domain knowledge can be used during the identification of such subgroups amid the population. We show a case study to analyze the results of students in a large scale university admission examination. Key-Words: Data Mining. Subgroup Discovery. Post-processing. Visualization. Association Rules, Distributions.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Interactive Discovery of Interesting Subgroup Sets

Although subgroup discovery aims to be a practical tool for exploratory data mining, its wider adoption is hampered by redundancy and the re-discovery of common knowledge. This can be remedied by parameter tuning and manual result filtering, but this requires considerable effort from the data analyst. In this paper we argue that it is essential to involve the user in the discovery process to so...

متن کامل

Visual Interactive Subgroup Discovery with Numerical Properties of Interest

Subgroup discovery consists in finding subsets of individuals from a given population which have distinctive collective properties with regard to one or more properties of interest. The interest of a subgroup can be objectively assessed using appropriate statistics, but it can also be evaluated by a data analyst or domain expert. In this paper we propose an approach to subgroup discovery via di...

متن کامل

مقایسه تأثیر سه رویکرد یاد‌دهی ـ یادگیری بر عملکرد یادگیری دانش‌آموزان در درس‌زیست‌شناسی

Present study was designed to investigate the effects of three teaching- learning approaches including discovery, interactive and transmission approaches on the students learning performance in biology lesson. In this quasi- experimental research three experimental groups (N1=60, N2=71, N3=63) were used in order to identify any significant difference between the students learning performance wh...

متن کامل

Novel Techniques for Efficient and Effective Subgroup Discovery

Large volumes of data are collected today in many domains. Often, there is so much data available, that it is difficult to identify the relevant pieces of information. Knowledge discovery seeks to obtain novel, interesting and useful information from large datasets. One key technique for that purpose is subgroup discovery. It aims at identifying descriptions for subsets of the data, which have ...

متن کامل

Expert Discovery: A web mining approach

Expert discovery is a quest in search of finding an answer to a question: “Who is the best expert of a specific subject in a particular domain within peculiar array of parameters?” Expert with domain knowledge in any field is crucial for consulting in industry, academia and scientific community. Aim of this study is to address the issues for expert-finding task in real-world community. Collabor...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007